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METEOD FOR PURIFYING KERATINOCYTE GROWTH FACTORS 

?\zl<A o£ £hs Invention 

5 The present invention relates to the field of 

protein purification- Specifically, the present 
invention relates to the field of purifying keratinocyte 
growth factors. 

10 " Background of the Invention 

Polypeptide growth factors are important 
mediators of intercellular communication (Rubin et al. 
(1989), Proc. Natl. Acad. Sci. USA, 33:802-806). These 

15 molecules are generally released by one cell type and 
act to influence proliferation of other cell types- 
One family of growth factors is the fibroblast 
growth factors (FGF) . There are currently eight known 
FGF family members which share a relatedness among 

20 primary structures: basic fibroblast growth factor, 
bFGF (Abraham et al. (1986), EMBO J., j£: 2523-2528) ; 
acidic fibroblast growth factor, aFGF (Jaye et al. 
(1986), Science, 211:541-545); int-2 gene product, int-2 
(Dickson & Peters (1987), Mature, 326 :833) : hst/kFGF 

25 (Delli-Bovi et al. (1987), Cell, 33:729-737, and 
Yoshida et al. (1987) , Proc. Natl. Acad. Sci. USA, 
34:7305-7309); FGF- 5 (Zhan et al. (1988), Mol. Cell. 
Biol., 3:3487-3495); FGF-6 (Maries et al. (1989), 
Oncogene, A:335-340); keratinocyte growth factor (Finch 

30 et al. (1989), Science, 24:752-755; Rubin et al. (1989), 
.Proc. Natl. Acad. Sci. USA, 86:802-806; Ron et al. 
(1993) , The Journal of Biological Chemistry, 
233141:2984-2988; and Yan et al. (1991) , Jn Vitro Cell. 
Dev. Biol., 22^:437-438) ; and hi sac tophi 1 in (Habazzettl 

35 et al. (1992), Mature, 153:855-858). 
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Among the FGF family of proteins, keratinocyte 
growth factor (KGF) is a unique effector of non- 
fibroblast epithelial (particularly keratinocyte) cell 
proliferation derived from mesenchymal tissues. The 
5 term "native KGF" refers to a natural human {hKGF) or 

recombinant (rKGF) polypeptide (with or without a signal 
sequence) as depicted by the amino acid sequence 
presented in SEQ ID NO: 2 or an allelic variant thereof . 
[Unless otherwise indicated, amino acid numbering for 

10 molecules described herein shall correspond to that 
presented for the mature form of the native molecule 
(i.e., minus the signal sequence), as depicted by amino 
acids 32 to 194 of SEQ ID NO:2.] 

Native KGF may be isolated from natural 

15 sources. For example, hKGF can be isolated from medium 
conditioned by an embryonic lung fibroblast cell line 
(Rubin et al. (1989) , supra. Three chromatographic 
steps, namely heparin-Sepharostf" (Pharmacia, Piscataway, 
NJ) affinity chromatography, HPLC gel filtration, and 

20 reverse-phase HPLC, were used to obtain a purified hKGF 
preparation. Approximately 6 mg of hKGF were recovered 
from 10 liters of conditioned medium. These 
chromatographic steps only recovered 0.8% total hKGF 
based upon a mitogenic activity assay. A further 

25 example teaches the use of another chromatographic step 
using heparin- Sepha rose 8 * affinity and Mono-S 1 * ion- 
exchange chromatography^ (Pharmacia, Piscataway, NJ) for 
isolation of rKGF produced in bacteria (Ron et al. 
(1993), Journal of Biological Chemistry, 2SJi:2984-2988) . 

30 The properties of keratinocyte growth factors 

suggest a potential for the application thereof as a 
drug for promoting specific stimulation of epithelial 
cell growth. It therefore would be desirable to develop 
a method or methods for obtaining relatively high levels 

35 of homogeneous keratinocyte growth factors to provide 
sufficient quantities of material for comprehensive in 
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vitro and in vivo biological evaluation and for a 
potential therapeutic application. 

It is the object of this invention to provide 
a novel method for the purification of keratinocyte 
5 growth factors. 



Summary of the Invent ion 

The present invention is directed to a first 
10 method for purifying a keratinocyte growth factor (KGF) , 
the method comprising! 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

15 c) eluting KGF in an eluate solution from 

the cation exchange resin ; 
d) passing the eluate solution from part (c) 
through a molecular weight exclusion 
matrix; and 

20 e) recovering KGF from the molecular weight 

exclusion matrix. 

The^ invention is further directed to a second 
method for purifying a keratinocyte growth factor (KGF) , 
25 the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

c) eluting KGF in an eluate solution from 
30 the cation exchange resin; 

d) performing hydrophobic interaction 
chromatography on the eluate solution of 
part (c) ; and 

e) recovering KGF from the hydrophobic 

35 interaction chromatography step of part 

(d) . 
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Generally, the cation exchange chromatography 
step of the first or second methods may be conducted 
with any suitable buffer (e.g., phosphate buffer saline, 
sodium acetate or tris-HCL) at a pH of preferably 
5 between about 6.8-7.5. Suitable columns for use in this 
step include carboxymethyl cellulose, carboxymethyl 
agarose and sulfated agarose and cellulose columns 
(e.g. , columns of S-Sepharose Fast Flow 1 " resin, Mono-S 5 * 
resin and CM-cellulose™ resin, commercially available 
10 from Pharmacia, Piscataway, NJ) . The flow rate will be 
variable depending upon the column size. 

The gel filtration step of the first method may be 
conducted in any suitable buffer (e.g., phosphate buffer 
saline) at a pH of preferably between about 7.0 and 7i5. 
15 Suitable columns for use in this step include agarose- 
based, acrylamide-based, silica-based or polymer-based " 
size-exclusion columns (e.g.> columns of Sephadex G-75™ 
resin and Superdex-75 TK resin, commercially available 
from Pharmacia) . 
20 In a particularly preferred embodiment of the 

second method, free sulfhydryl groups may be oxidized 
prior to the hydrophobic interaction step, discussed 
below. -Any manner of oxidation may be employed. For 
e x a mp le, the protein may be exposed to atmospheric 
25 oxygen for a suitable period of time. Alternatively, 

various oxidation procedures may be ...employed. One such 
procedure is particularly suited for keratinocyte growth 
factors wherein one or more cysteine residues, as 
compared to the native KGF molecule, are deleted or 
30 replaced. In this procedure an oxidizing agent (e.g., 
cyst amine dihydrochloride or another appropriate 
oxidizing agent, for instance, cystine, oxidized 
glutathione or divalent copper) may be added to a f inal 
concentration, adjusting the pH to preferably between 
35 about 7-9.5, with pH 9.0 ± 0.3*C being more preferred 
when using cystamine dihydrochloride) , and holding the 



W 96/11552 FCT/US95/13099 

- 5 - 

temperature at preferably between about 10-30 *C, for an 
appropriate period. The second procedure may be used 
for oxidizing native KGF and other keratinocyte growth 
factors with comparable patterns of cysteine residues. 
5 In this procedure, oxidation may be accomplished by 
adding an appropriate amount of an ionic strength 
modifier (e.g., (NH4>2S04))# adjusting the pH to 
preferably between about 7.5-9.5, and holding the 
temperature at preferably between about 23 ± 5 - C for an 
10 appropriate period. 

The hydrophobic interaction step of the second 
method may be conducted by using any suitable buffer 
(e.g., sodium phosphate) at a pH of preferably between 
about 6.0-8.0, more preferably about 7.0, and by eluting 
15 with a decreasing linear (NH4)2S04 gradient ranging from 
2-0 M. Suitable columns for use in this step include 
alkyl or phenyl substituted resins • {e.g.., a column of 
Butyl- 65 0M Toyopearl™ resin, commercially available from 
Tosohaas, Inc. , Montgomeryville, PA and columns of 
20 phenyl Sepharose™ resin and phenyl Superose» resin, 
commercially available from Pharmacia) . 

The process of the present invention may be 
used to purify KGF. Thais, it should be understood that 
the terms "keratinocyte growth factor" and "KGF" as 
25 employed in this description ate intended to include, 

and to mean interchangeably unless otherwise indicated, 
native KGF and KGF analog proteins (or "muteins") 
characterized by a peptide sequence substantially the 
same as the peptide sequence of native KGF and by 
30 retaining some or. all of the biological activity of 

native KGF, particularly non-fibroblast epithelial cell 
proliferation (e.g., exhibiting at least about 500-fold 
greater stimulation of BALB/MK keratinocyte cells than 
that of NIH/3T3 fibroblast cells, and at least about 50- 
35 fold greater stimulation of BALB/MK keratinocyte cells 
*-*>*n for BS/589 epithelial cells or for CC1208 
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epithelial cells, as determined by H- thymidine 
incorporation) . By -characterized by a peptide sequence 
substantially the same as the peptide sequence of native 
KGF" is meant a peptide sequence which is encoded by a 
5 DNA sequence capable of hybridizing to nucleotides 201 
to 684 of SEQ ID NO:l,' preferably under stringent 
hybridization conditions . 

The determination of a corresponding amino 
acid position between two amino acid sequences may be 
10 determined by aligning the two sequences to maximize 

matches of residues including shifting the amino and/or 
carboxyl terminus, introducing gaps as required and/or 
deleting residues present as inserts in the candidate. 
Database searches, sequence analysis and manipulations, 
IS may be performed using one of the well-known and 

routinely used sequence homology/identity scanning 
algorithm programs (e.g., Pearson and Lipman (1988), 
Proc. Natl. Acad. Sci. U.S.A., 35:2444-2448; Altschul et 
al. (1990), J. Mol. Biol.. 215:403-410; Lipman and 
20 Pearson (1985), Science, 222 :1435 or Devereux et al. 
(1984), Nuc. Acids Res., 12:387-395). 

Stringent conditions, in the hybridization 
context, -will be stringent combined conditions of salt, 
temperature, organic solvents and other parameters 
25 typically controlled in hybridization reactions. 
Exemplary stringent hybridization conditions- axe 
hybridization in 4 X SSC at 62-67° C, followed by 
washing in 0.1 X SSC at 62-67° C. for approximately an 
hour. Alternatively, exemplary stringent hybridization 
30 conditions are hybridization in 45-55% formamide, 4 X 
SSC at 40-45°C. [See, T. Maniatis et. al., Molecular 
Cloning (A Laboratory Manual) ; Cold Spring Harbor 
Laboratory (1982) , pages 387 to 389] . 

Thus, the proteins include allelic variations, 
35 or deletion(s), substitution (s) or insertion(s) of amino 
acids, including fragments, chimeric or hybrid molecules 
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of native KGF . One example of KGF includes proteins 
having residues corresponding to Cys 1 and Cys 1 * Q f SEQ TD 
NO: 2 replaced or deleted, with the resultant molecule 
having improved stability as compared with the parent 
5 molecule (as taught in commonly owned U. S.S.N . 

08/487,825, filed on July 7, 1995). Another example of 
KGF includes charge -change polypeptides wherein one or 
more of amino acid residues 41-154 of native KGF 
(preferably residues Arg 41 , Gin 43 , Lys 55 , Lys 95 , Lys 128 , 
10 Asn 137 , Gin 138 , Lys" 9 , Arg 144 , Lys* 4 ? , Gin"*, Lys*" or 

Thr 154 ) are deleted or substituted with a neutral residue 
or negatively charged residue selected to effect a 
protein with a reduced positive charge (as taught in 
commonly owned U.S. S.N. 08/323,337, filed on October 13, 
15 1994) . A still further example of KGF includes prote ins 
generated by substituting at least one amino acid having 
a higher loop-forming potential for at least one amino 
acid within a loop-forming region of Asn 115 -His 116 - 
Tyrll7.Asnll8.Thj.119 Q f native KGF (as taught in 
20 commonly owned U.S. S.N. 08/323,473, filed on October 13, 
1994) . • A still yet further example includes proteins 
having one or more amino acid substitutions, deletions 
or additions within a region of 123-133 .(amino acids 
154-164 of SEQ ID NO: 2) of native KGF; these proteins 
25 may have agonistic or antagonistic activity. 

Specifically disclosed proteins include the 
following KGF molecules (referred to by the residue 
found at that position in the mature protein (minus 
signal sequence) set forth in SEQ ID NO:2 r followed by 
30 that amino acid position in parentheses and then either 
the substituted residue or ■-" to designate a deletion): 
C(1,15)S, AN15-AN24, 4N3/C(15)S, 4N3/C(15)-, AN8/C(15)S, 
AN8/C(15)-, C(1,15)S/R(144)E, C (1, 15) S/R(144) Q, 
4N23/R(144)Q, C(1,15,40)S, C (1, 15, 102) S, 
35 C(1,15,102,106)S, AN23/N(137)E, AN23/K{139) E, 

AN23/K(139)Q, iiN23/R(144)A, AN23 /R (144) E, AN23/R(144) L, 
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AN23/K{147)E, AN23/K(147) Q, AN23 /K(153 ) E, AN23/K(153) Q, 
4N23/Q(152)E/K(153)E; R(144)Q and H(116)G. 

As those skilled in the art will also 
appreciate, a variety of host-vector systems may be 
5 utilized to express the KGF protein-coding sequence. 
These include but are not limited to eucaryotic cell 
systems such as mammalian cell systems infected with 
virus (e.g\ , vaccinia virus, adenovirus, etc.); insect 
cell systems infected with virus (e.g*. , baculovirus) ; 

10 microorganisms such as yeast-containing yeast vectors; 
or to procaryotic cell systems such as bacteria 
transformed with bacteriophage DNA, plasmid DNA, or. 
cosmid DNA. The expression elements of these vectors 
vary in their strengths and specificities. Depending , on 

15 the host-vector system utilized, any one of a number of 
suitable transcription and translation elements may be - 
used. 

Once the protein product of KGF expression has 
been isolated, purified and assayed for KGF activity 

20 (using procedures known to those skilled in the art) , it 
may be formulated in a variety of pharmaceutical 
compositions. Typically, such compositions include a 
suitable*/ usually chemically-defined, carrier or 
excipient for the therapeutic agent and, depending on 

25 the intended form of administration, other ingredients 
as well. The composition can include aqueous carriers 
or consist of solid phase formulations in which KGF is 
incorporated into non-aqueous carriers such as 
collagens, hyaluronic acid, and various polymers. The 

30 composition can be suitably formulated to be 

administered in. a variety of ways, including by 
injection, orally, topically, intr anas ally and by 
pulmonary delivery. 
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Brief Description of the Drawings 

Figure 1 shows the nucleotide (SEQ ID N0:1) 
and amino acid (SEQ ID N0:2) sequences of native KGF 
5 (the nucleotides encoding the mature form of native KGF 
is depicted by bases 201 to 684 of SEQ ID NO:l and the 
mature form of KGF is depicted by amino acid residues 32 
to 194 of SEQ ID NO:2) . 

Figures 2A, 2B and 2C show the plasmid maps of 
10 pCFM1156, ". pCFMl656 andpCFM3102, respectively. 

Figure 3 shows the nucleotide (SEQ ID NO: 3) 
and amino acid (SEQ ID NO: 4) sequences of the construct 
RSH-KGF. 

Figure 4 shows the nucleotide (SEQ ID NO: 5) 
15 and amino acid (SEQ ID NO: 6) sequences of the construct 
contained in plasmid KGF. 

Figure 5 shows the chemically synthesized 
OLIGOs (OLIGO#6 through OIiIGO#ll; SEQ ID NO: 12 -17, 
respectively) used to substitute the DNA sequence 
20 between a Kpnl site and an EcoRI site for a Kpnl site 
(from amino acid positions 46 to 85 of SEQ ID No: 6) in 
the construct contained plasmid KGF to produce the 
construct in plasmid KGF (dsd) . 

Figure 6 shows the chemically synthesized 
25 OLIGOs (OLIGO#12 through OI*IGO#24; SEQ ID NO: 18-30, 

respectively) used to construct KGF ( codon optimized) . 

Figure 7 shows the nucleotide (SEQ ID NO: 31) 
and amino acid sequences (SEQ ID NO:32) of C(1,15)S, a 
KGF analog having substitutions of serine for cysteine 
30 at amino acid positions 1 and 15 of native KGF. 

Figure 8 shows the nucleotide (SEQ ID NO: 33) 
and amino acid sequences (SEQ ID NO: 34) of 
C(l, 15) S/R(144)E, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
35 a substitution of glutamic acid for arginine at amino 
acid position 144 of native KGF. 
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Figure 9 shows the nucleotide {SEQ ID NO:35) 
and amino acid (SEQ ID NO: 36} sequences of 
C(1,15)S/R{144)Q, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
5 a substitution of glutamine for arginine at amino acid 
position 144 of native KGF. 

Figure 10 shows the nucleotide (SEQ ID NO: 37) 
and amino acid (SEQ ID NO: 38) sequences of AN15, a KGF 
analog having a deletion of the first 15 amino acids of 
10 the- N- terminus of native KGF. 

Figure 11 shows the nucleotide (SEQ ID NO: 39) 
and amino acid (SEQ ID NO: 40) sequences of AN23 , a KGF 
analog having a deletion of the first 23 amino acids- of 
the N- terminus of native KGF. , 
15 Figure 12 shows the nucleotide (SEQ ID NO: 41) 

and amino acid (SEQ ID NO: 42) sequences of AN23/R (144) Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamine 
for arginine at amino acid position 144 of native KGF. 

20 

Description of Specific Embodiments 

: ; Standard methods for many of the procedures 
described in the following examples, or suitable 

25 alternative procedures , ore provided in widely 

recognized manuals of molecular biology such as, for 
example, Molecular Cloning, Second Edition, Sambrook et 
al. , Cold Spring Harbor Laboratory Press (1987) and 
Current Protocols in Molecular Biology, Ausabel et al., 

30 Greene Publishing Associates/ Wiley- Interscience, New 
York (1990).. 

Example 1 : Preparation of DNA Coding for KGF and KGF Analogs 

35 The cloning of the full-length human KGF gene 

(encoding a polypeptide with the secjuence of native KGF) 
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was carried out both by polymerase chain reaction (PCR) 
of FNA from an animal cell and by PCR of chemically 
synthesized (E. coll optimized codon) oligonucleotides 
("OLIGOs") . Both procedures are described below: 
5 PCR amplification using RNA isolated from 

cells known to produce the polypeptide was performed. 
Initially, cells from a human fibroblast cell line 
AG1523A (obtained from Human Genetic Mutant Cell Culture 
Repository Institute For Medical Research, Camden, New 

10 Jersey) were disrupted with guanidium thiocyanate, 
followed by extraction (according to the method of 
Chomyzinski et al. (1987), Anal. Blochem. , 122:156) . 
Using a standard reverse transcriptase protocol for 
total RNA, the KGF cDNA was generated. PCR (PCR#1) 

15 amplif ication of the KGF gene was carried out using the 
KGF cDNA as template and primers OLIGO#l and 0IxIGO#2 
that encode DNA sequences immediately 5 ' and 3 • of the 
KGF gene [model 9600 Thermo cycler (Perkin-Elmer Cetus, 
Norwalk, CT) ; 28 cycles; each cycle consisting of one 

20 minute at 94°C for denaturation, two minutes at 60°C for 
annealing, and three minutes at 72°C for elongation] . A 
small aliquot of the PCR#1 product was then used as 
template for a 'second KGF PCR (PCR#2) amplif ication 
identical to the cycle conditions described above except 

25 for a 50°C annealing temperatut e . For expression 

cloning of the KGF gene, nested PCR primers were used to 
create convenient restriction sites at both f*nris of the 
KGF gene. OLIGO#3 and OLIGO#4 were used to modify the 
KGF DNA product from PCR#2 to include Mlul and BamHX 

30 restriction sites at the 5' arid 3' ends of the gene, 

respectively [PCR#3; 30 cycles; each cycle consisting of 
one minute at 94°C for denaturation , two minutes at 60°C 
for annealing, and three minutes at 72°C for 
elongation] . This DNA was subsequently cut with Mini 

35 and BamHX, phenol extracted and ethanol precipitated. 
It.was then resuspended and ligated (using T4 ligase) 
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into a pCFM1156 plasmid (Figure 2A) that contained a 
•RSH* signal sequence to make construct RSH-KGF (Figure 
3) - The ligation products were transformed (according 
to the method of Hanahan (1983), *7. Mol. Biol., H£:557) 
5 into E. coll strain FM5 (ATCC: 53911) and plated onto 
LB+kanamycin at 28°C. Several transf ormants were 
selected and grown in small liquid cultures containing 
20 jig/mL kanamycin. The RSH-KGF plasmid was isolated 
from the cells of each culture and DNA sequenced. 

10 Because of an internal Ndel site in the KGF gene,, it was 
not possible to directly clone the native gene sequence 
into the desired expression vector with the bracketed 
restriction sites of Ndel and BamHX. This was 
accomplished as a three-way ligation. Plasmid RSH-KGF. 

15 was cut with the unique restriction sites of Bsml and 

SstI, and a -3 kbp DNA fragment (containing the 3 ' end - 
of the KGF gene) was isolated following electrophoresis 
through a 1% agarose gel. A PCR (PCR#4) was carried out 
as described for FCR#3 except for the substitution of 

20 OLIGO#5 for OLIGO#3 . The PCR DNA product was then cut 
with Ndel and Bsml and a 311 bp DNA fragment was 
isolated following electrophoresis through a 4% agarose 
gel. The third fragment used in the ligation was a 1.8 
kbp DNA fragment of pCFM1156 cut with NdeX and SstI 

25 isolated following electrophoresis through a 1% agarose 
gel. Following ligation *(T4 ligase) , transformation, 
kanamycin selection and DNA sequencing, as described 
above; a clone was picked containing the construct in 
Figure 4, and the plasmid designated KGF. Because of an 

30 internal ribosomal binding site that produced truncated 
products, the KGF DNA sequence between the unique Fpnl 
and EcoRX sites was replaced with chemically synthesized 
OLIGOs (OLIGO#6 through 0LIG0#11) to minimize the use of 
the internal start site (Figure 5) . 



35 
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OLIGOS1 (S2Q ID NO:7): 5 " -CAATGACCTAGGAGTAACAATCAAC-3 ■ 

OLIGO#2 ( SEQ ID NO: 8) : 5' -AAAACAAACATAAATGCACAAGTCCA-3 ■ 

OLIGO#3 ( SEQ ID NO: 9) : 5 1 - ACAACGCGTGCAATGACATGACTCCA- 3 • 

OLIGO#4 (SEQ ID NO: 10) : 
5 5 ' - ACAGGATCCTATTAAGTTATTGCCATAGGAA- 3 ■ 

OLIGO#S (SEQ ID NO: 11) : 

5 * -ACACATATGTGCAATGACATGACTCCA- 3 1 
OLIGO#6 (SEQ ID NO: 12) : 

5 ' -CTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCC-3 f 
10 t OLIGO#7 • { SEQ ID NO : 13 ) : 

5 • -AAGAGATGAAAAACAACTACAATATTATGGAAATCCGTACTGTT- 3 * 
OLIGO#8 (SEQ ID NO:14) : 

5 * -GCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG-3 • 
OLIGO#9 (SEQ ID NO: 15) : 

OLIGO#10 (SEQ ID NO: 16) : 

5 ' -ACAGCAACAGTACGGATTTCCATAATATTGTAGTTGTTTTTCATC- 3 ■ 

OLIGO#ll (SEQ ID NO:17): 

5 * -AATTCAGATTCAACACCTTTGATTGCAACGATACCA-3 1 

20 

The OLIGOs were phosphorylated with T4 
polynucleotide kinase and then heat denatured. The 
single-strandeS- (ss) OLIGOs were then allowed to form a 
ds DNA fragment by allowing the temperature to slowly 
25 decrease to room temperature. 'T4 ligase was then used 
to covalently link both the internal OLIGO sticky-ends 
and the whole ds OLIGO fragment to the KGF plasmid cut 
with Kpnl and EcoRX. The new plasmid was designated 
KGF(dsd). 

30 A completely E. coll, codon-optimized KGF gene 

was constructed by PCR amplification of chemically 
synthesized OLIGOs #12 through 24. 
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5 ' -AGTTTTGATCTAGAAGGAGG-3 ' 
5 * -TCAAAACK3GATCCTATTAA-3 ' 



OI*IGO#12 (SEQ ID NO: 18) 
OI*IGO#13 (SEQ ID NO: IS) 
OLIGO#14 (SEQ ID NO:2Q) 

5 ' -AGTTTTGATCTAGAAGGAGGAATAACATATGTGCAACGACATGAC- 
5 TCCGGAACAGATGGCTACCAACGTTAACTGCTCCAGCCCGGAACGT-3 ' 

OLIGO#15 (SEQ ID NO:21) : 

5 ' - CACACCCGTAGCTACGACTACATGGAAGGTGGTGACATC CGTGTTC - 
GTCGTCTGTTCTGCCGTACCCAGTGGTACCTGCGTATCGACAAA- 3 * 

OLIGO#16 (SEQ ID NO:22) : 
10 5 1 -CGTGGTAAAGTTA^GGTACCCAGGAAATGAAAAACAACTA- 

CAACATCATGGAA^TCCGTAC'l'Gri'GCl' G 'r IXSGTATC Grm CAATCAAA- 3 ' 

OLIGO#17 (SEQ ID NO: 23 ) : 

5 ' -GGTGT T GAATCTGAA>TTCTACCTGGCAATGAACAAAGAAGGTAAAC - 
TGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAA- 3 * 
15 OLIGO#18 (SEQ ID NO:24) : 

5 1 -CTGATCCTGGAAAACCACTACAACACCTACGCATCroCTAAATGGA- - 
CC^CAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGT-3 ' 

OLIGO#19 (SEQ ID NO:2S) : 

2 0 CACTTCCTGCCGATGGCAATCACTTAATAGGATCCAGTTTTGA-3 • 

OLIGO#20 (SEQ ID NO; 26) : 5 ' -TACGGGTGTGACGTT C CGGG-3 ' 
OLIGO#21 (SEQ ID NO: 27) : 5 1 -CTTTACCA CG TTTGTCGATA-3 ' 
OLIGO#22 . (SEQ ID NO: 28) :S ' -ATTCAACACCTTTGATTGCA-3 1 
OLIGO#23 (SEQ ID NO : 29 ) : 5 ' -CCAGGATCAGTTCTTTGAAG-3 ' 

25 OI,IGO#24 (SEQ ID NO : 3 0 ) : 5 ' -rGAACCGGGATAC CTTTCTGG - 3 ' ' 

OLIGOs #12 through 24 were '^designed so that: 
the entire DNA sequence encoding native KGF was 
represented by OLIGOs from either the "Watson" or the 

30 "Crick" strand and upon PCR amplification would produce 
the desired double-stranded DNA sequence (Figure 6) 
[PCR#5, Model 9600 thezmocycler (Perkin-Elmer Cetus) ; 21 
cycles, each cycle consisting of 31 seconds at 94°C for 
denaturation, 31 seconds at 50°C for annealing, and 31 

35 seconds at 73°C for elongation; following the 21 cycles 
the PCR was finished with a final elongation step of 7 
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minutes] . After PCH amplification, the DNA fragment was 
cut with Xbal and BamHI and the 521 bp fragment ligated 
into the expression plasmid pCFM1156 cut with the same 
enzymes- PCR#5 utilized the outside primers 
5 (100 pmoles/100 pi rxn) OLlGO#12 and OLIGO#13 and 

1 pi/ 100 pi rxn of a KGF template derived by ligation 
(by T4 ligase) of OLIGOs #14 through #19 (OLIGOs#15 
through OLIGOs#18 were phosphorylated with T4 
polynucleotide kinase) using OLIGOs#20 through OLIGOs #2 4 
10 as band-aid oligos {Jayaraman et al. (1992), 

Bioteclmiques, 12:392) for the ligation. The final 
construct was designated KGF ( codon optimized). 

All of the KGF analogs described herein are 
composed in part from DNA sequences found in KGF(dsd) or 
15 KGF (codon optimized), or a combination of the two. The 
sequences are further modified by the insertion into 
convenient restrictions sites of DNA sequences that 
encode the particular KGF analog amino acids made 
utilizing one or more of the above-described techniques 
20 for DNA. fragment synthesis. Any of the analogs can be 
generated in their entirety by either of the above 
described techniques. However, as a part of the general 
OLIGO design optimized E. coli codons were used where 
appropriate, although the presence of E. coli optimized 
25 codons in part or in toto of any of the genes where 
examined did not significantly increase the yield of 
protein that could be obtained from cultured bacterial 
cells • Figures 7 to 12 set forth by convenient example 
particular KGF analog nucleotide and amino acid sequence 
30 constructions: C(1,15)S (Figure 7) ; C(1,15)S/R(144)E 
(Figure 8); C(1,15)S/R(144)Q (Figure 9); AN15 (Figure 
10); AN23 (Figure 11) and AN23/R(144)Q (Figure 12). All 
the KGF analog constructions described herein were DNA 
sequence confirmed. 
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-Ry^TTTolg 2 z Purification from E. coli 

Three different expression plasmids were 
5 utilized in the cloning of the KGF analog genes. They 
were pCFM1156 (ATCC 69702), pCFM1656 {ATCC 69576), and 
pCFM3102 (Figures 2A, 2B and 2C, respectively) . The 
plasmid p3102 can be derived from the plasmid pCFM1656 
by making a series of site directed base changes with 
10 PCR*. overlapping oligo mutagenesis. Starting with the 

Bglll site (pCFM1656 plasmid bp #180) immediately 5* to 
the plasmid replication promoter, PcopB,' and proceeding 
toward the plasmid replication genes, the base pair 
changes aire as follows: 

15 
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PCFM1656 bo # hp in pCTMlggg bo changed to in oCT-M3102 







# 


204 


T/A 


C/G 






# 


428 


A/T 


G/C 






# 


509 


G/C 


A/T 


5 




# 


617 




insert two G, 






# 


677 


G/C 


T/A 






# 


978 


T/A 


C/G 






# 


992 


G/C 


A/T 






# 


1002 


A/T 


C/G 


10 




# 


1005 


C/G 


T/A 






# 


1026 


A/T 


T/A 






# 


1045 


C/G 


T/A 






# 


1176 


G/C 


T/A 






# 


1464 


G/C 


• T/A 


15 




# 


2026 


G/C 


bp deletion 






# 


2186 


C/G 


T/A* 






# 


2479 


A/T 


T/A 




# 


2498-2501 


AGTG 


GTCA 


20 








TCAC 


CAGT 




# 


2641-2647 


TCCGAGC 


bp deletion 










AGGCTCG 




25 


# 


3441 


G/C 


: A/T 




# 


3452 


G/C 


A/T 




# 


3649 


A/T 


T/A 



# 4556 — insert bps 

30 (SEQ ID NO: 43) 5 1 ' 

( SEQ ID NO : 44 ) 5 1 -CTCGAGTGATCACAGCTGGACGTC - 3 * 



As seen above, pCFM1156, pCFM1656 and pCFM3102 
are very similar to each other and contain many of the 

35 same restriction sites. The piasmids were chosen by 

convenience, and the vector DN& components can be easily 
exchanged for purposes of new constructs. The host used 
for all cloning was E. coll strain FM5 (ATCC: 53911) and 
the transformations were carried out . (according to the 

40 method of Hfvnahan (1983), supra) or by electroelution 
with a Gene Pulsex** transection apparatus (BioRad 
Labor a ties, Inc., Hercules, CA) , according to the 
manufacturer's protocol. 

Initially, a small, freshly-cultured inoculum 

45 of the desired recombinant E. coll clone harboring the 
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desired construct on one of the three pCFM vectors was 
started by transferring 0.1 mL of a frozen glycerol 
stock of the appropriate strain into a 2 L flask 
containing 500 mL of Luria broth. The culture was 
5 shaken at 30 *C for 16 hours. Thereafter the culture was 
transferred to a 15 L fermentor containing 8 L of 
sterile batch medium (Tsai, et al. (1987), J\ .Industrial 
Microbiol., 2:181-187). 

Feed batch fermentation starts with the 

10 feeding of Feed # 1 medium (Tsai, ec al . (1987.)/ 

supra). When the OD600 reached 35, expression of the 
desired KGF analog was induced by rapidly raising the 
culture temperature to 37 *C to allow the amplification 
of plasmid. After two hours at 37 "C, the culture 

15 temperature was quickly raised to 42 "C to denature the 

CI repressor and the addition of Feed 1 was discontinued 
in favor of Feed 2, the addition rate of which was 
initiated at 300 mL/hr. Feed 2 comprised 175 g/L 
trypticase-peptone, 87.5 g/L yeast extract, and 260 g/L 

20 glucose. After one hour at 42 *C, the culture 
temperature was decreased to 36 'C. where this 
. temperature was then maintained for another 6 hours . 

c .The fermentation was then halted and the cells 
were harvested by centrifugation into plastic bags 

25 placed within 1 L centrifuge bottles. The cells were 
pelleted by centrifugation at 400 rpm for 60 minutes, 
after which the super natan ts were removed and the cell 
paste frozen at -90 "C. 

Following expression of the various KGF 

30 analogs in E. coli, native KGF, C(1,15)S, 

C(1,15)S/R(144)E, C(1,15)S/R(144)Q, AN15, AN23 , and 
AN23/R(144)Q protein were purified using the following 
procedure. Cell paste from a high cell density 
fermentation was suspended at 4 * C in 0.2 M NaCl, 20 mM 

35 NaFO*, pH 7.5 as a 10-20% solution (weight per , volume) 
using a suitable high shear mixer. The suspended cells 
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were then lysed by passing the solution through a 
homogenizer (AFV Gaul in, Inc., Everett, MA) thr ee times ■ 
The outflowing homogenate was cooled to 4-8 *C by using a 
suitable heat exchanger. Debris was then removed by 
5 centrifuging the lysate in a J-6B 1 * centrifuge (Beckman 
Instruments, Inc., Brea, CA) equipped with a JS 4.2 
rotor at 4,200 rpm for 30-60 min. at 4*C. Supernatants 
were then carefully decanted and loaded onto a 
previously prepared 450 mL (5 cm x 23 cm) column of S- 
10 Sepharose Fast Flow"* resin (Pharmacia) column 

equilibrated with 0.2 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4 # C. Next, the column was washed with five column 
volumes (2250 mL) of 0.4 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4*C. The desired protein was eluted by washing the 
15 column with 5 L of 0.S M NaCl, 20 mM NaP0 4 , pH 7.5. 

Again, 50 mL fractions were collected and the A2S0 of the 
effluent was continuously monitored. Fractions 
identified by A28O as containing eluted material were 
then analyzed by SDS-PAGE through 14% gels to confirm 
the presence of the desired polypeptide. 

Those fractions containing proteins of 
interest were then pooled, followed by the addition of 
an equal volume of distilled water. The diluted sample 
was then loaded onto a previously prepared 450 mL (5 cm 
x 23 cm) column of S-Sepharose* Fast Flow equilibrated 
with 0.4 M NaCl, 20 mM NaP0 4 , pH 6.8 at 4*C. The column 
was washed with 2250 mL of 0 .4 M NaCl, 20 mM NaP0 4 , pH 
6.8 and the protein eluted using a 20 column volume 
linear gradient ranging from 0.4 M NaCl, 20 mM NaP0 4 , pH 
6.8 to 0.6 M NaCl, 20 mM NaP0 4 „ pH 6.8. Again, 50 mL 
fractions were collected under constant A28O monitoring 
of the effluent. Those fractions containing the protein 
(determined by 14% SDS-PAGE) were then pooled, followed 
by concentration through a YM-10 membrane (10,000 
molecular weight cutoff) in a 350cc stirring, cell 
(Amicon, Inc. Mayberry, MA) to a volume of .30-40 mL. 
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The concentrate was then loaded onto a 
previously generated 1,300 mL (4.4 an x 85 an) column of 
Superdex-TP* resin {Pharmacia) equilibrated in column 
buffer comprising IX PBS (Dulbecco's Phosphate Buffered 
5 Saline, "D-PBS, ■ calcium and magnesium- free) or 0-15 M 
NaCl, 20 mM NaP04, pH 7.0. After allowing the sample to 
run into the column, the protein was eluted from the gel 
filtration matrix using column buffer. Thereafter, 
10 mL fractions were recovered and those containing the 

10 analog (determined by 14% SDS-PAGE) were pooled. 

Typically, the protein concentration was about 5-10 
mg/mL in the resultant pool. All of the above 
procedures were performed at 4-8 'C, unless otherwise 
specified. . 

15 An alternative purification procedure was used 

to purify native KGF, C(1,15)S and AN23 . The procedure * 
involves the following steps and, unless otherwise 
specified, all procedures, solutions and materials were 
conducted at 23 ± 5*C. 

20 Upon completion of the production phase of a 

bacterial fermentation, the cell culture was cooled to 
4-8 "C and the cells were harvested by centrifugation or 
a similar* process. On the basis of the expected yield 
of protein per unit weight of cell paste and the amount 

25 of purified protein required, an appropriate amount of 
cell paste, by weight, was suspended in a mild buffer 
solution, 20 mM NaP0 4 , 0.2 M NaCl, pH*7.5, weighing 
about five times that of the cell paste to be suspended. 
The cells were dispersed to a homogeneous solution using 

30 a high shear mixer. The temperature of the cell paste 
dispersion was maintained at 4-8 "C during 
homogenizat ion . 

The cells were then lysed by pressure, for 
example by passing the cell paste dispersion twice 

35 through an appropriately sized cell homogenizer. The 

homogenate was kept chilled at 5 + 3°C. To clarify the 
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cell lysate, a previously prepared depth filter housing 
(Cuno, Inc., Meriden, CT) equipped with a filter having 
an appropriate amount of filter surface area, 
equilibrated with a suitable volume of 0.2 M NaCl, 20 mM 
5 NaP04, pH 7.5 was employed. The equilibration and 
clarif ication were performed at 5 + 3 # C. Prior to 
clarification, an appropriate amount of a suitable 
filter aid was used to pre-coat the filter and be 
thoroughly mixed with the cell lysate, after which the 

10 lysate was clarified by passing the solution through the 
filter apparatus. The filter was washed with 0.2 M 
NaCl, 20 mM NaPC>4, pH 7.5. The filtrate and any 
subsequent wash were collected in a chilled container , of 
suitable capacity, all the while being maintained at 

15 less than 10 *C. 

Following clarification the lysate was then 
passed through a previously prepared column of 
SP-Sepharose Fast Flow containing at least 1 mL of resin 
per 2 g of cell paste. The column of SP-Sepharose Fast 

20 Flow was equilibrated with cold (5 ± 3*C) , 0.2 M NaCl, 
20 mM NaP04, pH 7.5. The temperature of the column was 
maintained at less than 10 # C. The clarified lysate 
(5 ± 3*C) was -then loaded onto the ion exchange column, 
with the absorbance at 280 run (A280) of eluate being 

25 continuously monitored. After* sample loading, the 
column was washed with cold 0.2 M NaCl, 20 mM NaP04, 
pH 7.5, followed by washing with 0.3 M NaCl, 20 mM 
NaP04/ pH 7.5 at 23 ± 5'C. 

To elute the desired protein, a linear 

30 gradient ranging from 0.2-1 M NaCl, 20 mM NaP0 4 , pH 7.5 
was used. Bulk, product was collected in several 
fractions on the basis of the A28O of the eluate. 
Following elution, these fractions were pooled and the 
volume noted. 

35 To oxidize free sulfhydryl groups, an 

oxidation step was performed. For proteins with altered 
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cysteine patterns, as compared to native XGF, an 
oxidizing agent (e.g., cystamine d i hydrochloride or 
another appropriate oxidizing agent, for instance, 
cystine, oxidized glutathione or divalent copper) was 
5 added to a final concentration of 1-20 mM and the pH was 
adjusted to 7-9.5, with a pH of 9.0 + 0.3 when cys tamin e 
dihydrochloride was used. The oxidation was conducted 
at 10 - 30 "C for an appropriate period. For the native 
KGF protein, oxidation was accomplished by adding an 

10 appropriate amount of (NH4)2S04 such as 1-2 M (NH4)2S04, 
adjusting the pH to 7.5-9.5, and holding the temperature 
at 23 + 5*C for an appropriate period. 

After oxidation, the pH of the solution was 
adjusted to between 6.5 and 9.5. If necessary, solid , 

15 (NH4 ) 2 SO 4 was added to the solution to a final 

concentration of 2 M. To remove particulates, the 
solution was passed through appropriate clarification 
filters . 

The filtered, oxidized product was then 

20 subjected to hydrophobic interaction chromatography 

(HIC) . The HIC matrice was Butyl-650M Toyopearl ,K resin 
(Tosohaas, Inc., Montgomeryville, PA). The protein- 
containing solution was loaded onto the column, which 
had been previously equilibrated with 2 M (N^^SC^, 0.15 

25 M NaCl, 20 mM NaPC^, pH 7«. 0 . After sample loading, the 
column was washed with 2 M (NH4)2S04, 0.15 M NaCl, 20 mM 
NaP04, pH 7.0. The desired protein was then eluted 
using a decreasing linear (NH4>2S04 gradient ranging from 
2-0 M developed in 0.15 M NaCl, 20 mM NaP04, pH 7.0. 

30 When the desired protein .began to elute, as indicated by 
an increase in the A280 of the eluate, fractions were 
collected. Aliquots of each fraction were then analyzed 
by SDS-PAGE. Those fractions containing the desired 
protein were then pooled, thoroughly mixed, and the 

35 volume of the pool determined, as was the concentration 
of the protein therein. 
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The pooled HIC protein-containing eluate was 
then concentrated and the elution buffer exchanged. 
Typically, proteins were concentrated to 5. 0-10,0 mg/mL. 
Ultrafiltration was conducted using an ultrafiltration 
5 system equipped with a Pellicon» cassette system 

(Millipore, Inc., Bedford, MA) with an appropriately 
sized cut-off membrane 

After concentration, the sample was 
diafiltered against an appropriate buff er. The 
10 retentate from the concentration step was diafiltered 
"against 0.15 M NaCl, 20 mM NaP0 4 , pH 7.0 until the 
conductivity of the retentate was within 5% of the 
conductivity of the 0.15 M NaCl, 20 mM NaP04, pH 7.0 
solution. 

15 In addition, to remove precipitates and 

bacterial endotoxin that might be present, the 
concentrated diafiltered protein-containing sample was 
passed through a 0.1 pm Posidyne 1 * filter (Pall, Inc., 
Cortland, NY) . After determining the protein 

20 concentration of the solution and on the basis of the 
desired concentration of the final, bulk product, the 
solution was diluted with 0.15 M NaCl, 20 mM sodium 
phosphate, pH 7.0, to the desired final concentration. 
A final aseptic filtration through a 0.22 pm filter, was 

25 then performed as the final bulk product was transferred 
to a pyrogen- free container for storage (at about 5*C) 
for further formulation. 

Example 3 = Purification from Mammalian Cell Culture 

30 

This example describes the expression, 
isolation, and characterization of two biologically 
active recombinant KGF (rKGF) forms produced in a 
mammalian expression system. 
35 The human KGF gene was isolated by PCR 

amplif ication of cDNA made from normal dermal human 



WO 96/11952 



- 24 - 



PCTAJS95O3099 



fibroblast cells (Clonetec, Inc., Palo Alto, CA) . 
Following tlie making of cDNA by reverse transcriptase, 
PCR was used to amplify the KGF gene. OLIGO#25 and 
OLIGO#26 were used to amplify the gene out of the cDNA 
5 and OLIGO#27 and OLtIGO#28 were used to place HindCII and 
BglXZ restriction sites at the fragment ends by a second 
PCR amplif ication, as set forth in Figure 1. 



OLIGO#25 (SEQ ID NO: 45) 

10 OLIGO#26 (SEQ ID KO:46) 

OLIGO#27 (SEQ ID NO: 47) 

OLIGO#28 (SEQ ID NO: 48) 



5 ' -CAATCTACAATTCACAGA- 3 • 

5 1 -TTAAGTTATTGCCATAGG-3 ' 

5 * - AACAAAGCTTCTACAATTCACAGATAGGA- 3 * 

5 * -AACAAGATCTTAACSTTATTGCCATAGG-3 * 



Following cloning and DNA sequence 
15 confirmation, the KGF gene DNA was then used. i 
Amplification was effected using two primers: 

OLIGO#29 (SEQ. ID. NO:49): 

5 ' -CGGTCTAGACCACCATGCACAAATGGATACTGACATGG- 3 ' 

20 OLIGO#30 (SEQ. ID. NO: 50) : 

5 1 -GCCGTCGACCTATTAAGTTATTGCCATAGGAAG- 3 1 

The s ens e primer, OLIGO#29, included an -Xbal 
site and* a consensus Kozak translation sequence (5 f - 

25 CCACC-3') upstream of the start codon, ATG. The 

antisense primer, OLIGOS30, included a Sail cloning 
site and an additional stop codon. After 18 cycles of 
PCR amplif ication (30 sec. denaturation at 94 *C, 40 sec. 
ann ealing at 55 *C, and 40 sec. elongation at 72 *C), the 

30 product was digested with XbaZ and Sail and ligated with 
a similarly digested DNA. of pDSRa2 (according to the 
methods of Bourdrel et al. (1S93), Protein Exp. & 
Purif. , 4:130-140 and Lu et al. (1992), Arch. Biochem. 
Biophys., 298 :150-158) . This resulted in plasmid 

35 KGF/pDSRa2 which placed the human KGF gene between the 
SV40 early promoter and the a-FSH polyadenylation 
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sequences. Two clones were picked and DNA sequence 
analysis confirmed construction of the desired vector. 

Two micrograms of KGF/pDSRa2 DNA were then 
linearized with PvuX. Chinese hamster ovary (CHO) 
5 cells, seeded the day before at 0.8 x 10 s cells/60 mm 

culture dish, were then transfected with the treated DNA 
using a standard calcium phosphate precipitation method 
(Bourdrel et al., supra) . Two weeks later, individual 
colonies were picked and transferred into 2 4 -well 
10 plates- The conditioned media was considered serum- free 
when the cells reached, conf luency and aliquots thereof 
were analyzed by Western blotting using a polyclonal 
rabbit antiserum reactive against E. eoli-expressed 
human KGF. 

15 Westerns were performed by running samples 

through 12.5% (w/v) SDS polyacrylamide gels, followed by 
electroblotting for 1 hr. at 400 mA : onto nitrocellulose 
membranes using a semidry transfer apparatus (Hoefer 
Scientific Instruments, San Francisco, CA) . 20 mM Tris, 

20 150 mM glycine, 20% methanol served as the transfer 
buffer. The nitrocellulose sheets were blocked by 
incubation with 10% normal goat serum in PBS. Rabbit 
anti-serum raided against E- coli-derived KGF was used 
as pri m a r y antibody. For use, it was diluted 1/10,000 

25 in 1% normal goat serum in PBS 'and incubated with the 
blocked nitrocellulose sheets for 12 hr. at room 
temperature , after which excess antibody was removed by 
three 30 min. washes in PBS. The nitrocellulose 
membranes were then incubated in 100 mil of 1% normal 

30 goat serum in PBS containing Vectastain 1 * biotinylated 
goat anti -rabbit IgG (secondary antibody. Vector Labs, 
Burlingame, CA) , for 30 minutes at room temperature. 
After three 10 minute washes in PBS, a 30 minute room 
temperature incubation was performed in a 100 mL 

35 solution of 1% normal goat serum containing streptavidin 
and biotinylated peroxidase, prepared according to 
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manufacturer- 1 s directions (Vector Labs) . Following 
three washes in PBS, KGF cross-reactive material was 
visualized by incubation in a mixture of 60 jiL of 30% 
(w/v) H2O2 in 100 mL of PBS and 50 zng of 4 -chlor onap thol 
5 in 20 mL of methanol. The reaction was stopped by 
rinsing in water after 10 minutes. 

Analysis of the blots revealed that the KGF- 
specific antibody associated with three distinct protein 
bands, two being closely related with molecular weights 

10 of about 25-29 kDa and one with an estimated molecular 
weight of about 17 kDa, as compared to the expected 
molecular weight of approximately 18.8 of the 163 amino 
acid mature protein. Additionally, several high 
expressing clones secreting more than 2.0 mg of rKGF per 

15 liter, as judged by Western analysis, were selected and 
expanded into roller bottles (according to the method of 
Lu et al. , supra) to generate large volumes of serum- 
free conditioned medium for purification of KGF by 
cat ionic exchange chromatography and gel filtration, as 

20 set forth below. 

KGF from 3 L of serum- free conditioned medium 
was purified applying the medium directly to a cation 
exchange column (5 x 24 cm) packed with 450 mL of 
sulfoethyl column of SP-Sepharose Fast Flow (Pharmacia) 

25 pre-equilibrated with 20 mM sodium phosphate, pH 7.5. 
After washing with five column volumes of 20 mM sodium 
phosphate, 0.2 M NaCl, pH 7.5, rKGF was eluted using a 
20 column volume linear gradient of 0.2 to 1.0 M NaCl in 
20 mM sodium phosphate, pH 7.5. 50 mL fractions were 

30 collected with continuous. A28O monitoring. KGF protein 
was detected by analyzing aliquots of each fraction by 
SDS-PAGE. SDS-PAGE was performed on an electrophoresis 
system (Novex, San Diego, CA) using precast 14% 
Tris-glycine precast gels (according to the method of 

35 Laemmli (1970), .Mature, 2222 680-685). Samples were 
mixed with non-reducing SDS sample buffer without 
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heating before loading. The proteins were detected by 
either Coomassie blue or silver staining. Two late- 
eluting peaks were seen to contain protein bands 
corresponding to the 25-29 kDa and 17 kDa bands detected 
5 by Western blot. The fractions containing each of these 
peaks were separately concentrated to a volume of less 
than 1.0 mL and subjected to gel filtration. 

The gel f iltrations employed columns of 
Superdexr75 T » i resin (HR 10/3 0, Pharmacia) pre^ 
10 equilibrated with PBS, pH 7.2, and calibrated with the 
following known molecular weight standards (BioRad, San 
Francisco, CA) : thyroglobulin (670 kDa) , gamma globulin 
(158 kDa) , ovalbumin (44 kDa) , myoglobin (17 kDa) and 
vitamin B-12 (1.4 kDa) . These purification steps 
15 resulted in an approximate 2000-fold purification of 
rKGF, specifically including a 17 kDa and a 30 kDa 
material, as estimated by silver staining. 

In the instance of the higher molecular weight 
material, rKGF eluted as a major symmetrical peak, which 
was called KGF-a. Upon SDS-PAGE analysis of a lesser 
amount of this material, 3 pg/lane versus 6 tig /lane, two 
bands with a 1-2 kDa molecular weight difference were 
resolved. In the instance of the lower molecular weight 
material., termed KGF-b, gel filtration resulted in a 
protein preparation having the expected mobility. For 
both KGF-a and KGF-b, the overall yield after 
purification was approximately 30-40%. 

Amino acid sequences from KGF-a and KGF-b were 
also analyzed. These analyses were performed on an 
automatic sequencer, (Model 477A or 47 OA, Applied 
Biosystems, Inc., Foster City, CA) equipped with a Model 
12 OA on-line PTH- amino acid analyzer and a Model 900A 
data collection system (according to the method of Lu et 
al. (1991), J. Biol. Chera., 2££: 8102-8107) . Edman 
sequence analysis of KGF-a revealed a major N- terminal 
sequence of Xi-N-D-M-T-P-E-Q-M-A-T-N-V-X 2 -X 3 -S- (SEQ ID 
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NO: 51) . A minor sequence starting from the third N- 
terminal amino acid, aspartic acid, was also present in 
1.6% of the total sequenceable protein. X lf x 2 , and X 3 
were the unassigned due to the absence of 
5 phenyl thiohydantoinyl (PTH) amino acid signals during 
sequence analysis. 

Interestingly, N- terminal sequence analysis of 
KGF-b revealed an N-terminal amino acid sequence of 
S-Y-D-Y-M-E-G-G-D-I-R-V- (SEQ ID NO: 52), indicating that 
10 it is an N-terminally truncated form of KGF that has 

been proteolytically cleaved at the Arg 23 -Ser 24 peptide 
bond. 

To further characterize purified KGF -a and 
KGF-b, the protein was subjected to glycosidases * 

15 (neuraminidase, O-glycanase, and/or N-glycanase) , using 
known techniques (Sasaki et al. (1987), J. Biol. Cftera. , 
2£2.:12059-12076; Takeuchi et al. (1988), *7. Biol. Chem. , 
251:3657-3563; Zsebo et al. (1990), Cell, £1:195-201). 
These data indicate that KGF -a contains N- and O-linked 

20 carbohydrates, although the lower molecular weight form 
of KGF- a probably contains only N-linked sugar. 
Glycosidase treatment did not cause molecular weight 
reduction* f or KGF-b, indicating that the molecule is 
^glycosylated. 

25 

Example 4 : Biological Activity 

Each KGF analog was diluted and assayed for 
biological activity by measuring the [ 3 H] -thymidine 

30 uptake of Balb/MK cells (according to the method of 
Rubin et al. (1989), supra). The samples were first 
diluted in a bioassay medium consisting of 50% customer- 
made Eagle's MEM, 50% customer-made F12, 5 jxg/mL 
transferrin, 5 ng/mL sodium selenite, 0.0005% HSA and 

35 0.005% Ttoeen 20. KGF samples were then added into 
Falcon primeria 96-well plates seeded with Balb/MK 
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cells. Incorporation of [3tt] -Thymidine during DNA 
synthesis was measured and converted to input native KGF 
concentration by comparison to a native KGF standard 
curve. Each of the tested analogs exhibited mitogenic 
S activity - 

Interaction with the KGF receptor was examined 
using isolated KGF receptor membrane preparations 
prepared from Balb/MK mouse epidermal keratinocytes (by 
the procedure described by Massague (19932), J\ Biol. 

10 Chem. , 13614-13620) . Specifically, various forms of 

KGF were diluted with 50 mM Tris-HCl, pH 7.5, containing 
0.2% bovine serum albumin so as to range in 
concentration from 0.8 ng to 100 ng per 50 nL. They 
were individually incubated with the membrane 

15 preparation (75 ng/mL) and 125 I-labeled E. coli-derived 
KGF (1.5 ng) . Receptor binding and conpetition 
experiments were performed at 4'C for 16 hr., after 
which time samples were taken, centrifuged, and washed 
twice with the above diluent buffer to remove unbound 

20 and non- specifically bound, labeled KGF. Samples were 
then counted for the remaining radioactivity. 
Competition curves for receptor binding between KGF 
samples and labeled KCJF were constructed by plotting 
percent uncompetition versus concentrations of each KGF 

25 sample. Radioreceptor assay uncompetition experiments 
indicated that E. coli-derived KGF, KGF-a, and KGF-b 
have similar receptor binding activity. 

While the present invention has been described 
above both generally and in terms of preferred 

30 embodiments, it is understood' that other variations and 
modifications will occur to those skilled in the art: in 
light of the description above. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 



(ii) TITLE OF INVENTION: Method for Purifying Keratinocyte 

Growth Factors 

(iii) NUMBER OF SEQUENCES: S2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Inc. 

<B) STREET: 1840 DeHavilland Drive 

(C) CITY: Thousand. Oaks 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 91320-1789 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,830 

(B) FILING DATE: 

(C) CLASSIFICATION: not yet known 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown. 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAATCTACAA TTCACAGATA GGAAGAGGTC AATGACCTAG GAGTAACAAT CAACTCAAGA 60 

TTCATTTTCA TTATGTTATT CATGAACACC CGGAGCACTA CACTATAATG CACAAATGGA 120 

TACTGACATG GATCCTGCCA A CTTTG CTCT ACAGATCATG CTTTCACATT ATCTGTCTAG 180 

TGGGTACTAT ATCTTTAGCT TGCAATGACA TGACTCCAGA GCAAATGGCT ACAAATGTGA * 240 

ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA CATGGAAGGA GGGGATATAA 300 



WO 96/11952 



PCT/US9S/13099 



- 31 - 

GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG GATCGATAAA AGAGGCAAAG 360 

TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT GGAAATCAGG ACAGTGGCAG 420. 

TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA TCTTGCAATG AACAAGGAAG 480 

GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA CTTCAAAGAA CTAATTCTGG 540 

AAAACCATTA GAACACATAT GCATCAGCTA AATGGACACA CAACGGAGGG GAAATGTTTG 600 

TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA AACGAAGAAA GAACAAAAAA 660 

CAGCCCACTT TCTTCCTATG GCAATAACTT AATTGCATAT GGTATATAAA GAACCCAGTT 720 

CCAGCAGGGA GA TTTC f T TA AGTGGACTGT TTTCTTTCTT CTCAAAATTT TCTTTCCTTT 780 

TATTTTTTAG TAATCAAGAA AGGCTGGAAA AACTACTGAA AAACTGATCA AGCTGGACTT 840 

GTGCATTTAT GTTTGTTTTA AG 862 
(2) INFORMATION FOR SEQ ID NO:2: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) IYF3JGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Lys Trp lie Leu Thr Trp lie Leu Pro Thr Leu Leu Tyr Arg 
1 5.. 10 15 

Ser Cys Phe His lie lie Cys Leu Val Gly Thr lie Ser Leu Ala Cys 
20 25 30 

■ 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 

Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 
65 70 75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 90 95 

He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He Lys Gly 
100 105 110 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
115 120 125 
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Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu 
130 135 140 

Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Glv 
145 150 155 i 6 o 

Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val Arc Gly 
165 170 17 | 

Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala 
180 185 190 

He Thr 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATCGATTTGA TTCTAGAAGG AGGAATAACA TATGAAAAAG CGCGCACGTG CTATCGCCAT 60 

TGCTGTGGCT CTGGCAGGTT TCGCAACTAG TGCACACGCG TGCAATGACA TGACTCCAGA 120 

GCAAATGGCT ACAAATGTGA ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA 180 

CATGGAAGGA GGGGATATAA GAGTGAGAAG ACT CTTCTG T CGAACACAGT GGTACCTGAG 240 

GATCGATAAA AGAGGCAAAG TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT 300 

GGAAATCAGG ACAGTGGCAG TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA 360 

a 

TCTTGCAATG AACAAGGAAG GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA 420 

CTTCAAAGAA CTAATTCTGG AAAACCATTA CAACACATAT GCATCAGCTA AATGGACACA 480 

CAACGGAGGG GAAA TGTTTG TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA 540 

AACGAAGAAA GAACAAAAAA CAGCCCACTT TCTTCCTATG . GCAAT AACTT AATAG 595 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Lys Arg Ala Arg Ala lie Ala lie Ala Val Ala Leu Ala Gly 
1 S 10 15. 

Phe Ala Thr Ser Ala His Ala Cys Asn Asp Met Thr Pro Glu Gin Met 
20 25 30 

Ala Thr Asn Val Asn Cys Ser Ser Pro Glu Arg His Thr Arg Ser Tyr 
35 40 45 

Asp Tyr Met Glu Gly Gly Asp lie Arg Val Arg Arg Leu Phe Cys Arg 

50 ; " 55 -60 

Thr Gin Trp Tyr Leu Arg lie Asp Lys Arg Gly Lys Val Lys Gly Thr 
65 70 75 80 

Gin Glu Met Lys Asn Asn Tyr Asn lie Met Glu lie Arg Thr Val Ala 
85 90 95 

Val Gly lie Val Ala He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala 
100 105 110 

Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp 
115 120 125 

Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala 
130 135* 140 

Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val Ala Leu Asn 
145 150 155 160 

Gin Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys 
165. . 170 175 

Thr Ala His Phe Leu Pro Met Ala He Thr 
180 185 

{2) INFORMATION FOR SEQ ZD NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TATGTGCAAT GACATGACTC CAGAGCAAAT GGCTACAAAT GTGAACTGTT CCAGCCCTGA 60 
GCGACACACA AGAAGTTATG ATTACATGGA AGGAGGGGAT ATAAGAGTGA GAAGACTCTT 120 
CTGTCGAACA CAGTGGTACC TGAGGATCGA TAAAAGAGGC AAAGTAAAAG GGACCCAAGA 180 
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GATGAAGAAT AATTACAATA TCATGGAAAT CAGGACAGTG GCAGTTGGAA TTGTGGCAAT 240 
CAAAGGGGTG GAAAGTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA " 300 

GAAAGAATGC AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC 360 

ATATGCATCA GCTAAATGGA CACACAACGG AGGGGAAATG TTTGTTGCCT TAAATCAAAA 420 

GGGGATTCCT GTAAGAGGAA AAAAAACGAA GAAAGAACAA AAAACAGCCC A CTTTC TTCC 480 

TATGGCAATA ACTTAATAG 499 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 
1^5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn lie Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu TSrr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 ~ 150 155 160 

Met Ala He Thr 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CAATGACCTA GGAGTAACAA TCAAC 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

AAAACAAACA TAAATGCACA AGTCCA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 26 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown * 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9 
ACAACGCGTG CAATGACATG ACTCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACAGGATCCT ATTAAGTTAT TGCCATAGGA A 31 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cBNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACACATATGT GCAATGACAT GACTCCA 27 
(2) INFORMATION FOR SEQ ID NO: 12: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 
(DJ TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCGTATCG ACAAACGCGG CAAAGTCAAG GGCACCC 37 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown' 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SE Q UENC E DESCRIPTION: SEQ ID NO: 13 : 
AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTT 



44 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTGTTGGTA TCGTTGGAAT CAAAGGTGTT GAATCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTTGGGTGC CCTTGACTTT GCCGCGTTTG TCGATACGCA GGTAC 
(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5" base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ACAGCAACAG TACGGATTTC CATAATATTG TAGTTGTTTT TCATC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AATTCAGATT CAACACCTTT GATTGCAACG ATACCA 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGTTTTGATC TAGAAGGAGG 20 
(2) INFORMATION FOR SEQ ID NO: 19: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE2JGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCAAAACTGG ATCCTATTAA 20 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: .SEQ ID NO: 20: 
A GTTTTG ATC TAGAAGGAGG AATAACATAT GTGCAACGAC ATGACTCCGG AACAGATGGC 
TACCAACGTT AACTGCTCCA GCCCGGAACG T 



60 
91 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CACACCCGTA GCTACGACTA CATGGAAGGT GGTGACATCC GTGTTCGTCG TCTGTTCTGC 60 
CGTACCCAGT GGTACCTGCG TATCGACAAA 90 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUQTCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGTGGTAAAG TTAAAGGTAC CCAGGAAATG AAAAACAACT ACAACATCAT GGAAATCCGT 60 
ACTGTTGCTG TTGGTATCGT TGCAATCAAA 90 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: » 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SE QUENCE DESCRIPTION: SEQ XD NO: 23: 
GGTGTTGAAT CTGAATTCTA CCTGGCAATG AACAAAGAAG GTAAACTGTA CGCAAAAAAA 
GAATGCAACG AAGACTGCAA CTTCAAAGAA 



60 
90 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTGATCCTGG . AAAACCACTA CAACACCTAC GCATCTGCTA AATGGACCCA CAACGGTGGT 60 
GAAATGTTCG TTGCTCTGAA CCAGAAAGGT 90 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid ; 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ATCCCGGTTC GTG G TAAAAA AACCAAAAAA GAACAGAAAA CCGCTCACTT CCTGCCGATG 60 
GCAATCACTT AATAGGATCC AGTTTTGA 88 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TACGGGTGT G ACGTTCCGGG 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID JNO:27: 
CTTTA CCACG TTTGTCGATA 20 
(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ATTCAACACC TTTGATTGCA 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SE QUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
' (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQ UENC E DESCRIPTION: SEQ ID NO: 29: 
CCAGGATCAG TTCTTTGAAG • 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUOSCE. DESCRIPTION: SEQ ID NO : 3 0 : 
GAACCGGGAT AC CTTTC T G G 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTACCTGGCA ATGAACAAAG AAGGTAAACT GTACGCAAAA 300 

AAAGAATGCA ACGAAGACTG CAACTTCAAA GAACTGATCC TGGAAAACCA CTACAACACC 360 

TACGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCGG TTCGTGGTAA AAAAACCAAA AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 
(BJ TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 " 55 60 
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Tyr Asn lie Met Glu He Arg Thr Val Ala Val Gly lie Val Ala lie 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 L10 

lie Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 160 

Met Ala He Thr 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 : 

ATGTGCAATG ATATGACTCC TGAACAAATG GCTACCAATG TCAACTGTTC CTCTCCGGAG 60 

CGCCACACCC GGAGTTACGA TTACATGGAA GGTGGGGATA TTCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTA CTGTT G CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
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(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 164 ami no acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 
15 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

lie Asp Lys Arg Gly Lys Val Lya Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO 

He Leu GluCAsn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr;. Ala His Phe Leu Pro 
145 ISO 1S5 160 

Met Ala He Thr 



(2) INFORMATION FOR SEQ ID NO:35: : 

(i) SEQ UENC E CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi> SE QU EN CE DESCRIPTION: SEQ ID NO: 35: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT C G TT G CAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG* CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 : 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 » 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Tnr Gin Trp Tyr Leu Arg 
35 40 45 * 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 



WO 96/11952 



- 46 



FCT/US9S/13099 



Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly lie Pro Val 
130 135 140 

Gin Gly Lys Ly3 Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 160 

Met Ala lie Thr 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 
<B)- TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: ; 
ATGTCTTCTC CTGAACGTCA TACGCGTTCC TACGACTACA TGGAAGGTGG TGACATCCGC 60. 

GTACGTCGTC TGTTC T GCCG TACCCAGTGG TACCTGCGTA TCGACAAACG CGGCAAAGTC 120 

AAGGGCACCC AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTTGCTGTT 180 

GGTATCGTTG CAATCAAAGG TGTTGAATCT GAATTCTACC TGGCAATGAA CAAAGAAGGT 240 

AAACTGTACG CAAAAAAAGA ATGCAACGAA GACTGCAACT TCAAAGAACT GATCCTGGAA 300 

AACCACTACA ACACCTACGC ATCTGCTAAA TGGACCCACA ACGGTGGTGA AATGTTCGTT 360 

GCTCTGAACC AGAAAGGTAT CCCGGTTCGT GGTAAAAAAA CCAAAAAAGA ACAGAAAACC 420 

GCTCACTTCC TGCCGATGGC AATCACTTAA 450 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly 
! 5 10 I 5 

Gly Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu 
20 25 30 
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Arg lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asa 
35 40 45 . 

Asn Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly lie Val Ala 
50 55 60 

He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly 
65 ' 70 75 80 

Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu 
85 90 95 

Leu lie Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr 
100 105 no 

His Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro 
115 120 125 

Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu 
130 135 140 

Pro Met Ala He Thr 
145 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426- base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TCCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA TTATGGAAAT CCGTA C T G TT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTACCTGGC AATGAACAAA GAAGGTAAAC TGTACGCAAA AAAAGAATGC 240 

AACGAAGACT GCAACTTCAA AGAACTGATC CTGGAAAACC ACTACAACAC CTACGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCG 360 

GTTCGTGGTA AAAAAACCAA AAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 
ACTTAA . 



426 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu 
1 S 10 IS 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 - 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105. 110 

Ala Leu Asn^Sln Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Lei; Pro Met Ala He Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SE QU E NCE CHARACTERISTICS : 

(A) L EN GT H : 426 base pairs 

(B) TYPE: nucleic acid 

(CJ STRANDEDNESS: unknown : 
(D) TOPOLOGY: unknown 



<ii) MOLECULE TYPE: cDNA 
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{») SEQUENCE DESCRIPTION : SEQ ID NO: 41: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA' TTATGGAAAT CCGTACT G TT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA GAAAGAATGC 240 

AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC ATATGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCT 360 

GTTCAAGGTA AGAAAACCAA GAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 

ACTTAA 426 
(2) INFORMATION FOR SEQ ID NO: 42: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val Arg Arg Leu 
1.5 X0 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 » 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 . 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
'85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105 HO 

Ala Leu Asn Gin Lys Gly He Pro Val Gin Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala He Thr 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO:43 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
GAGCTCACTA GTGTCGACCT GCAG 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: complement (1.-24) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CTGCAGGTCG ACACTAGTGA GCTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: M 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown . 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
CAATCTACAA TTCACAGA 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
TTAAGTTATT GCCATAGG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
AACAAAGCTT CTACAATTCA CAGATAGGA 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQ U E NC E DESCRIPTION: SEQ ID NO: 48 
AACAAGATCT TAAGTTATTG CCATAGG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
CGGTCTAGAC CACCATGCAC AAATGGATAC TGACATGG 
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(2) INFORMATION FOR SEQ ID NO:50r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCCGTCGACC TATTAAGTTA TTGCCATAGG AAG 33 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown ; 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Xaa Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Xaa Xaa Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val 
1 5 10 
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WHAT IS CLAIMED IS: 

1. A method for purifying a keratinocyte growth 
factor (KGF) , the method comprising: 

5 a) obtaining a solution comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
. .cation exchange resin; 

10 d) passing the eluate solution from part (c) 

through an appropriate molecular weight 
exclusion matrix; and 
e) recovering KGF from the molecular weight 
exclusion matrix. 

15 

2. The method according to Claim 1 wherein the 
KGF is produced in procaryotic cells. 

3. The method according to Claim 1 wherein the 
20 KGF is produced in E. coli. 

4. The method according to Claim 1 wherein the KGF 
is produced in* mammalian cells. 

25 5 . The method according* to Claim 4 wherein the 

KGF is produced in Chinese hamster ovary cells. 

6. A method for purifying a keratinocyte growth 
factor (KGF) , the method comprising: 
30 a) * obtaining a solution, corner is ing KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
cation exchange resin; 
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d) performing hydrophobic interaction 
chromatography on the eluate solution of part 
(c) ; and 

e) recovering KGF from the hydrophobic 
interaction chromatography step of part (d) . 

7. A method according to Claim 6 further 
comprising oxidation of free sulfhydryl groups in KGF. 

' a. The method according to Claim 6 wherein the 
KGF is produced in procaryotic cells. 

9. The method according to Claim 7 wherein the 
KGF is produced in E. coll. 

10. The method according to Claim 6 wherein the / 
KGF is produced in mammalian cells. 

11. The method according to Claim 10 wherein the 
KGF is produced in Chinese hamster ovary cells. 
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^TCAAG^TAATTA^^ 

' " "~ " ~ ' H- 240 



MRKMTN ZME I R T V A V G I V A l' 



EcoRX 

~ C ** ASGGGT ^ g AA*GTGAAXT CTAT^^ 

K2GKLTAK 



KCVESEFYLA MN " ^ - ~ ' + 300 



H " 1 — ► ' | , « 0 

KBCHEDCMFKEtX tEMHrMT 
Meter 

- ATATGCATCAGCTAAATGg ACACACAACGGaGGGGAA A T G T g TGI I U CCTTAAATCAAAA- 

rASAKWIHNGGEMrVALKQK* 420 

" K3GGGATTCCT ^AAGAGGAAAAAAAACGAAGA U fi a i rmm r-^ ^ j T rc j 7CC _ 
— I ■ ■ 48Q 



GXPVRGEKTKKEQ KT 



A m r z. P 



-TATGGCAATAACTTAATAG 3' 

— > i ■ 503 

M A X T • 
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substitution of Kpnl to EcoRI sequence to make KGF(dsd) 
KpnX 

) ' ■ 0I^G0»6 1 | — 0LIG0#7— — — _ 

5 » C^^IATC GACAAA CGCGGC^ 

3 ' CATGGACGCAT AGCTGTTTGCGCCGTTTCAGTTCCCGTGGGTTCTCTA C I rTTTGS IJU ATGTTA- 

I —C!XG0#9 1 |— OLIGO#10 

-LRIDKRGXVKGTQEHKNNrN- 



— OX^G0#8— ~ i 



5 • -AXTAX GGAAA TCCGTACIGI IGClbX IXjGXATCGTTGCAATCAAAGGTGTTGAATCTG 3 
3 f -TAATACCTTTJU»3CATGACAACGACAACCAXXGCAACG« 5 

—I I OLIGOill 1 

I M E--X RTVAVG XV A I K G. * V £ S E - 



WO 1552 
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KGF (codon optimized) 

Xb+X 

I 0LIGO#12 1 

5 r ACTTTTGATCT AGAAGGACG 3* 

I - OLIGO #14 



5 • AGTTTTGATCTXGAAGGAGGAATAACXTATGTGCAACGACATGACTCCGGAACAGAXGGCT— 



■I I 0LIGO#15- 



— ACCAACGTTAACTGCTCCAGCCCGGAACCTCACACCCGTAGCTACGACTACXTGGAAGGTG— 
3» GGC CCT rG CXSTCTGGgCAT 5' 
I —OLIGO #20 1 



OLIGO#15 — 1 |- 

— GTGACAT CCG T G T TC G I C G I C I G I Tt. JtaCgCTAgCCAgTgSCTAgeTfSggT ATC?»XC AAACG— 

3' ATAGCTGTTTGC- 
! -OLIGC#21 — 



-OLIGO#lS- 



— TGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTACAACXTCATGGAAATCCGTACT- 
-ACCATTTC S» ■ ■ 
1 



•I I ■ ■ 0t>XGO#17- 



r xigfl J rGC AATCAAAG G W t rGA ATCTGAAXTCTXCCTGGCAATGA A CA— 
3 s ACGTTA GI 1 ICCACAACTTA 5' 
I OLIGO#22 1 



0LIG0#17 ■ f | — 

-AAGAAGGTAAACTOTACGCAAAAJUUUPU^XGCAA 

3 'GAAGTTTCTTGACTA— 
I— OtIGO#23 



-OLIGO#18- 



- C CTGGAAAACCACT ACAACACCT ACGC AT C T GC TAAA1 
-<K»CC 5» 



•I I O LIGO#19- 

rAgCCCGGTTCGTGGTAAAAAAACCAAAAAAGAACAGA- 



3* QBXC2 1 rCC ATA GGGCCA XG V 
I OLIGO#24 1 



rAATAGGATCCXGTTTTGA 3* 
3* AATTAXCCTAGGTCAAAXCT 5** 
I ——OLIGO #13 1 
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Figure 7 
KGF CU,15)S 

5 ' ATG7CTAATGATATGACTCCGGXACAGATCGCTAC 

M5HDMT9EQMATNVNS SSPE " 
-CGTCACACGCCTTCCTACGACTACATGGAAGGTGGTGACArCCGCGTACGTCGTCTGTTC— 

RHTRSTDXMSG6DXRVRRL F 
-TGCCSTACCCAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACC^ 



ISO 



CRTQWrLRZDKRGKVXGTQE 

- ATGAAXAAC^UCT ACAAT ATT ATGGAAATCCGT AC TlS'l TGC S S T 'lGGTATCSTTGCAAgC— 

" H * " 1 11 : 1 » ■ •+ 240 

MKN NTNZME Z R T V A V G Z V A X 

-AAAGCTCTTGAATCTGAATTCT^^ 

» ■■»■■■ i » ■ ► 300 

KGVESEFTZ.AMNKEGKI.rAK 

-AAAGA^TGCAACGAAGACTGCAACTTCAAA^ 

; ' » » h h- » 3*o 




420 



rASAKW THHGGEM.FVAX.NQK 

-GGTATC CCGGI T CG I G GTAAAJUUUCCAAAAAJU^^ 

G ZPVRGKK TKKEQKT AHFLP 

-ATGGCAATCACTTAA 3 9 

1 495 

M A Z T • 



480 



WO 96/11952 PCT/US95/13099 

11/15 
Figure 8 
KG? R(144)Q 




MCKDMTP SQMAT 
^GCCACACCCGGAGTTACGATTJICATGGAA^ 
RHTRSYDYMEGGDrRVRR fr 

-TGCCGTACccALaX^rAgcrc^GTATgGAi^AAmgry^ mag- 

, , , H , ^ lfl0 




* , ' — « ► 240 

M X N N Y M I M E I R T V A V G I V A I 

-AAAGGTGTTGAATCTGAATTCTATCTTGCAATGAACAAGGAAGGAA 

> " »■ ■ " ' f " 300 

KGVES ErXLAMNKEGKLTAX 

-AAAGAAXGCAATGAAGATTGTAACTTCAAAGAACTAATTCT 
» ■ ■ ■ > ■ +- h 3 60 

KECNEDCNFXELXX.E HHYNT 

-TATGCATCTGCTAAATGGACCCACAACGGTGGTGA ^ 

— * 1 ¥ » I > 420 

?ASAKWTHHGGEM FVA&HQK 

-GGTAXC CCTG3 rC AAGGTAAGAAAACCAAGAAAGAACXGAAAAC CG CTCA CI ICCTG CCG- 

1 — ■ l ■ ■ -l 1 ~ ■ ■ ■» 480 

G XPVQGXKTXXEQKTAHFLP 

-ATGGCAATCACTTAA 3» 

1 495 

M A X X * 
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Figure 9 
KGT CC1,15)S/R(144)Q 

M3H0MTPEQMATHVNS SSPE " . 
-CSTCACAC GCG1 iCCIA CGACYACAgGGAAGgTGgTGAC AICCGCU TA CGICur U T GrrC- 

R H T R S T D Y M S G 6 D X R V R R j. F ! 120 
-TGCCSTACCCAGTGGTACCTGCGTATCGACAAACGCGGCAAAGTC^ 

' _ * 1 1 > iao 




MK M.MTKZME IRTV.AVGXVA~i + 24 ° 

-*AXC»TGTTGAATCTGAATTCTATCTTGCAATGA*CAAra 

.,, i - » ■ I i , i 300 

KGVESErXX.AMNKSGKLYAlt 

-AAAGAATGCAATGAAGATTGTAACTTCAAAGAACTAATra 

^ 1 1 . ► 360 

KECMEDCHFKELXLENHTMT 

^^^^QCATCTGCTAAA!T Q ^Arf^lP l AgSCTSCTf5AJLa L T a T t gia ; r ru w ■" r ftfrcr?lfiajA- 

> " — ■ ■ * ■ ► 1 ■ ■» I ■ » 420 

TASAKWTHM GGEM rVAI.MQK 

•^GTAgCCCIGI 1%IAACSTXAGAAAACCAAGAAAGAACAGAAAACCGCTCACTTCCTGCC^ 

► ■ ' I ► - » * » 480 

GXPVQGKKTKKEQKTAHFLP 

-ATGGCAAICACTTAA 3» 

► ■ ■ ■ 49S 

M A X T * 
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Figure 10 
KGF AN15 

5 9 ATG rCI IC T CC Xa AACGTCATACG CG T I CC rA CGXCTACATGGAAGGTGGTGACATCCGC- 

i » ' » ■ i k €0 

MSSPSRBTRSTOrMBG GDZR 

-GTACUIOilC It5 T inilGCCGIACCCAGTGGZACCTGCGTATCGACAAAC G CGGCAAAGTC- 

1 . . > i ■■ ► 12C 

VRRLFCRTQWTX.RIOKR G K V 

-AAGGGCACCCAAGAGAXGAAAAACAACTACAATATT^^ IG CTGTT- 

1 1 1 1 -+ iao 

1CGTQEMKHHYNIMEZRTVAV 

-GGTATCGTTGCAATCAAAGGTGTTGAATCTGAAT^ 

— > ■ ■»■■■■ > ( mU 240 

G Z V. .A Z X G V B S E r r . Z. A M M K E G 

• AAACTGT ACGCAAAAAAAGAATGCAACGAAGACTGCAACT 

»■ » ■ > ■ ■ » -i h 300 

XI.rAKKECWEDCNFKEI. ZLE 

^ AACCACTACAAC A CCTACGCATCTGCTAAATGGACCCACAACGGTGGTGAA AI G ITCST ! - 

1 1 1— ► ■ »■ I 360 

HBYHTYA3AKWTBHGGBMFV 

AX.NQKGZPVRGKK T X X E Q X T 

TAA 3* 
- 450 



AHFZ.PMAZT 
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Figure 1 1 . 
KGF 4N23 

5 • ATGTCCTACGACTACATG6AA GGTGG T G ACAXCCGCSTAC GTCGTCTGT I ' lXUC CGTACC- 
i , | ~ — "* 



MSTDTMEGGDIRVRRL rCRT 



— CAGTGGTXCCXGCGTASCGACAAACGCGGCAAAGTGAAGGGGACCCAAGA^ 
v ■ I 1 ■ i "f * 



QHTLRXDKRGXVRGTQEM RN 
-AAC TACA ATATTATGGAAAXCCGTACXGI ZGCIUl rGGTAXCSTTGCAAXCAAAGGTGTT— 

NYNIMEIRTVAVGIVAIXG "v" 
-GAATCTGAATTCTACCTGGCAAXGAACAAAGA^ 

E S C r * L A M N X E G K.Z. T A X X E C 



* 120 



180 



240 



—AACGAAGACTGCAACTrCAAAGAACTGATCCTGGAAAACCACTACAACACCTACGCATCT'* 

» 1 > » ■ * ► 300 



NEOCHFXE 
-GCTJ 



L E 



r m t r a s 



M CC C G — 

2 SO 



AKWTHHGGEMFVALHQXG IP 

- GI rCgTGGTAAAAAAACCAAAAAAGAACAGAAAACCGCTCACl TCCIUC OGAgGGCAATC- 

I > ■■ I : > ♦ 420 

VRGXKTXXEQXTA HFLPKAZ 

-ACTTAA 3* 

42« 

T • 
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Figure 12 
KG? AN23/R(144)Q 
5 ' AgGTCCTACGACTACAgGGAAGGTGGTGA C^^ 

MSro *MEGGDXRVRRz.'fc € ° 



R T 



-CAGTGCTACCTGCSTAgCSACAAA^^ 




IMEIRTVAVGXVAXXGV 
— GAATCTGAAJTCTAIC J! I GCAATGA AT* AGGJ »m» »JLigTf^ym,«r^*XAAf«31AAGAATCC ■ 



120 



180 



E 3 E F TLAMHKSGX Z. T A X K E C- 
-AATGAAGAXTGTAACTTCAAAGAACTAAITCTGGAAAACCAXT^^ 



240 



c m r 



X.EHRXHTTAS 



300 



— <*CTAAATGGACCC ACAA CGG r GG TUAA AXGX ICGXltjCrCT ' i ^ ArrAfSAAAGGTATrrrT— 

1 ^ ' 1 : — ► 360 



AXWT H MGG&MFVA 



H Q X G I P 



V Q G X 

-ACTTAA 3 f 
42fi 



XTRKEQKT A : H F Z> P M A I 



420 
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